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Abstract 

There are many familiar situations in which a manager seeks to design a system in which users share 
a resource, but outcomes depend on the information held and actions taken by users. If communication is 
possible, the manager can ask users to report their private information and then, using this information, 
instruct them on what actions they should take. If the users are compliant, this reduces the manager's 
optimization problem to a well-studied problem of optimal control. However, if the users are self-interested 
and not compliant, the problem is much more complicated: when asked to report their private information, 
the users might lie; upon receiving instructions, the users might disobey. Here we ask whether the manager 
can design the system to get around both of these difficulties. To do so, the manager must provide for the 
users the incentives to report truthfully and to follow the instructions, despite the fact that the users are self- 
interested. For a class of environments that includes many resource allocation games in communication 
networks, we provide tools for the manager to design an efficient system. In addition to reports and 
recommendations, the design we employ allows the manager to intervene in the system after the users 
take actions. In an abstracted environment, we find conditions under which the manager can achieve the 
same outcome it could if users were compliant, and conditions under which it does not. We then apply 
our framework and results to design a flow control management system. 
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I. Introduction 

There are many situations in which a manager seeks to design a system for users to share a 
resource, optimizing it according to some given benevolent or selfish criterion. If the manager has 
full information and users cannot act independently of the manager, the manager's problem is one 
of optimal control and is well-studied. If the users have information the manager does not have 
and act independently of the manager, but communication between the users and the manager is 
possible and users are compliant, the manager's problem is only slightly more complicated: the 
manager can simply ask the users to report their private information and then provide instructions 
on how it wishes them to behave. Because the users are compliant, they will report truthfully and 
obey instructions, so, whatever the manager's objective, this again reduces to a known problem 
in optimal control. However, if the users are self-interested and strategic, two difficulties arise. 
The first is that the users might lie about their private information - if it is in their individual 
interests to do so; the second is that the users might disobey the instructions of the manager - 
if it is in their individual interests to do so. The manager's problem in this setting is to design 
a system to maximize its objective function, given the self-interested and strategic nature of the 
users. A case of particular interest is that of a benevolent manager, who seeks to allocate resources 
efficiently or fairly according to some measure of social welfare. Efficient resource allocation is 
crucial to make the system accessible to many users and provide each of them with good service. 
However, the problem faced by a benevolent manager may be no easier than the problem faced 
by a selfish manager, who maximizes some measure of its own personal welfare, because the 
strategic interests of the individual users will be different from the interests of the group of users 
as a whole, and hence may still lead individual users to lie and to disobeyO 

In the economics literature, such problems are formalized in terms of mechanism design [fTTI - 
|[T6l . The usual approach is to design a system in which the users make reports to the manager 
on the basis of their private information, the manager provides instructions to the users based on 
these reports, and the users then take actions that maximize their own welfare. A version of the 
revelation principle lfT6l implies that such systems can always be designed so that the users find 

'Even in the absence of private information, the strategic interests of the individual users usually lead to the over-use of resources 
and to substantial inefficiencies 171. t2l. 
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it in their own self-interest to report truthfully and act obediently. We merge such an approach 
with the innovation introduced by ifTTl . and applied to situations of medium access control 0, 
|[T8ll . and power control [4], by allowing for intervention by the manager ^ That is, we allow the 
manager, in addition to designing a system of reports and instructions, to deploy an intervention 
device that intervenes after the users take actions. The action of this intervention device depends 
on the reports and the actions of the users, and it follows an intervention rule designed by the 
manager. The intervention device adds to the manager's ability to provide incentives for the users 
to report truthfully and obey instructions by threatening punishments if users lie and/or disobey. 

In this paper we explore the manager's problem in a class of abstract environments that exhibit 
some features common to many resource sharing situations in communication networks, including 
power control 0, flU, medium access control (MAC), 0, O, and flow control IO-0I1- We will 
characterize a coordination mechanism, i.e., a system of reports, recommendation and intervention, 
that is optimal (from the point of view of the manager) among all mechanisms. We provide 
conditions on the environment under which it is possible for the manager to achieve its benchmark 
optimum - the outcome it could achieve if users were compliant - and conditions under which it 
is impossible for the manager to achieve its benchmark optimum. Although we can characterize 
the optimal mechanism, other mechanisms are also of interest, for several reasons. The optimal 
mechanism may be very difficult to compute, and hence to execute. It is therefore of some 
interest to consider mechanisms that are sub-optimal but easy to compute, and we provide a 
simple algorithm that converges to such a mechanism. Moreover, in some situations, it may not be 
possible for the users to communicate with the manager, so it is natural to consider intervention 
schemes that do not require the users to make reports. Finally, we apply these results in the 
context of flow control. Computations show, among other things, that the considered schemes can 
considerably increase the efficiency of the system. 

There is by now a substantial communication engineering literature that addresses the problem 
of providing incentives for strategic users to obey a particular resource allocation scheme. Some of 
this literature adopts pricing schemes that charge users for their resource usage. Pricing schemes 
can be divided into two categories: pricing for strategic users lfT9l -||22 l and pricing for distributed 



2 A packet-dropping scheme that follows the same philosophy as intervention was proposed for flow control games in Q. 
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algorithms yl, 11231 . [|24|. The former is used for scenarios where the users are self-interested 
and strategic, as in our scenario. Such users are required to pay real money for their resource 
usage. If the manager knows how a payment affects the utility of a user, it can give the incentives 
to the user to adopt a particular resource allocation scheme by setting the right prices. Such 
pricing schemes may achieve the goal of optimal levels of resource usage, but suffer from the 
following drawbacks: (1) the users are forced, "by contract", to pay depending on their resource 
usage and on the state of the system^; (2) the manager has to know the users' monetary valuation 
for the service; (3) a secure infrastructure to collect the money is needed. Pricing for distributed 
algorithms is used for scenarios where the users are compliant and game theory is used as a tool 
to obtain an efficient distributed algorithmic In this case the users accept passively the utilities 
imposed by the manager, that incorporate a term that represents a cost, even though the payments 
do not actually need to be carried out. The distributed algorithm is obtained forcing the users 
to act as selfish agents that maximize such utilities, using for example a best response dynamic. 
Game theory allows to foresee the outcome of this interaction, and the manager has to design the 
users' utilities to obtain a desired outcome. 

A different literature, including 0|, Q, 1(171 . lfT8l . adopts the intervention schemes considered 
here. Intervention differs from pricings in that it operates inside the system while pricing operates 
outside the system: both schemes provide the manager with a tool to alter the utility of users, 
but intervention affects resource usage - and hence utility - directly, while pricing affects utility 
indirectly, through payments. Thus, intervention is more robust than pricing: users cannot evade 
intervention but they might be able to evade monetary charges, moreover, the manager does not 
need to know the users' monetary valuation for the service in intervention schemes. 

So far, both intervention and pricing schemes have mainly been applied in communication 
engineering games with complete information, i.e., assuming that the manager knows the relevant 
information held by the users. There are few works that address the problem of extracting the 

3 Current communication networks use different business models. 

4 This is not the scenario considered in this paper, but we want to complete the discussion on pricing schemes to remark that 
they might be applied to two different scenarios and to avoid misunderstandings. 

5 Since in this paper we consider self-interested and strategic users, we implicitly refer to the first category of pricing schemes. 
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Knowledge of users' mone- 
tary valuations for the service 


Users' behaviors in 
reporting information 


Users' behaviors in 
taking actions 


This work (intervention + 
mechanism design) 


not needed 


Truthful communication 
enforced by intervention 


Actions enforced by 
intervention 


Intervention 


not needed 


Compliant users 


Actions enforced by 
intervention 


Pricing for strategic users 


needed 


Compliant users 


Actions enforced by 
payments \ contract 


Pricing for distributed 
algorithms 


not needed 


Compliant users 


Compliant users 


Conventional mechanism 
design (e.g., auctions) 


This is the information the 
users are asked to report 


Truthful communication 
enforced by the scheme 


Actions enforced by 
payments \ contract 



TABLE I 

Comparison of different incentive schemes exploited in communication engineering literature. 



relevant information from the users. Such works (e.g., Il25ll - lt27l0 apply the ideas of mechanism 
design for auctions, creating schemes that ask the users to reveal their monetary valuation for the 
service and, depending on it, to pay for their resource usage. These schemes suffer from the same 
defects as the previously cited pricing schemes: the users are forced to pay depending on their 
resource usage and a secure infrastructure to collect the money is needed. Table U summarizes 
the main differences between the above described incentive schemes used in communication 
engineering literature and our approach. 

The remainder of this paper is organized as follows. In Section HH we introduce the coordination 
mechanism model, using Myerson's framework lfT6l as the reference. In Section Hill we study the 
properties of the optimal mechanism. In Section |IVj we consider two suboptimal mechanisms 
which, under some assumptions, are easier to compute with respect to the optimal mechanism. In 
Section |Vl we analyze and we show the results for the flow control game, both in the complete 
and incomplete information frameworks, and with and without intervention. Section IVTl concludes 
with some remarks. 
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II. A GENERALIZED COORDINATION MECHANISM FOR PRIVATE INFORMATION PROBLEMS 

We consider a manager that wants to design a system whose resources will be used by n users, 
J\f = {1,2, ...,n} denoting the set of users. Each user might have private information that the 
manager cannot observe and might take an action that the manager cannot directly control. We 
denote by Tj = {r^i, 7^2, Ti, mi } C R, rrii G N, the finite set of user i's private information, in 
which the elements are labeled in increasing order, i.e., 7^1 < Tj 2 <, < Ti >nH . We denote by 
Di = [dY lm , d™ ax ] C E the set of user i's possible actions. We refer to tj G Tj and to di G -D» as 
the type and the action of user i. As an example, each action di may represent user i's level of 
resource usage, while each type U may represent i's personal valuation for the resource. We denote 
by D = Xjzj/Dj and T = Xj^Tj the set of joint action profiles and the set of joint type profiles, 
i.e., all the possible combinations of users' actions and users' types; and by D_j = Xj^j^uyDj 
and T_j = Xj^^qTj all the possible combinations of users' actions and users' types except for 
user i. Thus, the symbols t G T, d G D, t_ { G TLj and cLj G D_j represent vectors. 

We assume that the manager can instruct a device, which we refer to as the intervention device, 
that will interact with the users in the system. The aim of the manager is to design the intervention 
device so that the outcome of the system maximizes the manager's objective. The intervention 
device has three features: 1) it can communicate with users; 2) it can monitor users' actions; 3) 
it can take an action of its own, which we interpret following [fTTl as an intervention. We define 
the intervention rule f : D — >■ D = {d™ m , d™ ax ] as a function that maps an action profile to 
an action of the intervention device, and we denote by T the finite set of intervention rules that 
the intervention device can implement. For the moment (the role of the intervention device for 
incomplete information scenarios will be clear in Section III-CI) . we assume that the intervention 
device takes an action following the randomized intervention rule n designed by the manager, such 
that 7r(/) > 0, V / 6 J, and Y^feF^if) = 1- The randomized intervention rule is communicated 
to all users before they select their actions. After observing users' actions d, the intervention 
device picks an intervention rule / following the probability distribution n and intervenes with 
an action f(d). We refer to the couple (Do, J 7 ) as the intervention capability. Finally, we denote 
by U Q : T x D x T — > K the manager's utility function, and by ^ : J 7 x D x T - >IR user i's 
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utility function, where T = x ieAf Ti, % = [n t i, r i>m jE 

A. Assumptions on utilities 

We assume that the manager's utility satisfies the following assumptions, \/d E D and V£ E T, 
Al: There exists d* Q E D such that U (dQ,d,t) > U (d ,d,t), Vd G Ah 7^ 
A2: d*(t) = argmax d Uo(dl, d, t) is unique 
A3: d*(t) is differentiable with respect to tj and 8 ^ > 

Assumption Al states that d^ is the most preferred action of the manager, regardless of users' 
actions and type profile. In games where the intervention device drives users' actions by threatening 
punishments, the intervention can be interpreted as the level of punishment and d* Q as the absence 
of intervention. 

By assumption A2, for every type profile t E T and for every user i E M, the users' joint 
action profile that maximizes the intervention device's utility is unique, and by assumption A3, 
each component in d*(t) is continuous and increasing in the type of that user. If actions represent 
the level of resource usage and types represent resource valuations, assumption A3 asserts that 
the higher z's valuation the higher should be z's level of resource usage. 

For each type profile t E T, we define the game 

r°HAW^K,-,i)}r =1 ) a) 

T° is the complete information game (i.e., users know everything about the structure of the game, 
in particular, they know the types of the other users) that models the interaction between strategic 
users having types t when the intervention device adopts the action d* Q independently of users' 
actions. It can be thought as the complete information game that models users' interaction in the 
absence of an intervention device. 

We denote by d NE \t) = (d? E °(t), . . . , d% E °(t)^j a Nash Equilibrium (NE) of the game T° t , 
which is an action profile so that each user obtains its maximum utility given the actions of the 



6 We require the manager's utility to be defined over the continuous interval T C K™, that includes the finite type set T, because 
the property A3 needs a set in which the differentiation operation is defined. However, the results in the rest of the paper are 
obtained under the condition that each user i's type belongs to the finite set Ti. 
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other users, i.e., 

Ut (d* ,d NE °(t),t} > Ui (tfocMT (*),*) , Vdi e A , Vz e AT (2) 

Notice that we have a different game r°, and therefore a different NE action profile, for each 
possible type profile t G T. For this reason d NE °(t) is represented as a function of t. 

We assume that users' utilities U^d^, d, t) are twice differentiable with respect to d and, G D, 

A4: Ui{d* , d, t) is quasi-concave in di and there exists a unique best response function d BR (d_i, t) = 

argmax^ Ui(dQ, d, t) 
A5: < 

odiOdj — i-i 

A6: There exists d NE °(t) such that d NE \t) > d*(t) Q and d^ E \r k ,t^ k ) > d* k (T k ,t_ k ) for some 
users k E M and type r fc G T fe 

Assumption A4 states that T° is a quasi-concave game and the best response function d BR (d_i, t) 
that maximizes U^d^d^t) is unique. Hence, either z's utility is monotonic with respect to di, or 
it increases with di until it reaches a maximum for d BR (d_i,t), and decreases for higher values. 
As a consequence, a NE d NE °(t) of T° exists. In fact, the best response function d BR (d,t) = 
[d BR {d^i, t), . . . , d BR (d^i, t)) is a continuous function from the convex and compact set D to D 
itself, therefore Brouwer's fixed point theorem assures that a fixed point exists. 

Assumption A5 asserts that T° is a submodular game and it ensures that d BR (d-i,t) is a non 
increasing function of dj. Interpreting di as z's level of resource usage, this situation reflects 
resource allocation games where it is in the interest of a user not to increase its resource usage 
if the total level of use of the other users increases, in order to avoid an excessive use of the 
resource. Nevertheless, assumption A6 says that strategic users use the resources more heavily 
compared to the optimal (from the manager's point of view) usage level. 

The class of games satisfying assumptions A4-A6 includes the linearly coupled games [6] and 
many resource allocation games in communication networks, such as the MAC 0, jH, power 
control 0, flU and flow control ll6l- lfT0l games. Moreover, if the manager's utility is increasing 
in the users' utilities (e.g., sum-utilities or geometric mean) and the intervention represents a 



'Throughout the paper, inequalities between vectors are intended component-wise. 



9 



punishment, also assumptions A1-A3 are satisfied in these games and the absence of intervention 
represents the intervention device's preferred action o^. 

B. Actions enforcement for the complete information game 

We first introduce the framework to design incentives to enforce users' actions in the complete 
information scenario, though the main focus of this paper is the design of a system for an 
incomplete information setting, dealing both with information revelation and action enforcement. 
The notations and concepts introduced in the following will become useful later, when we study 
the incomplete information scenario. In fact, some properties of the incomplete information game 
(i.e., the game where users do not know the types of the other users) are linked to the properties 
of the complete information game defined in this Subsection. 

Given a randomized intervention rule n, we define the complete information game 

r t = (Af,A-FiMKli) (3) 

that models the interaction between strategic users having types t. The utility functions Ui(-,t) 
are the expectations, over the randomized intervention rule, of the original utilities: 

Ui (d, t)=E f [Ui (/, d, t)} = * (/ I *) Ut (/, d, t) (4) 

where n (f | t) denotes the probability that the intervention device adopts the intervention rule 
/ G J 7 given that the type profile is t, and E x [■] is the expectation operator with respect to the 
random variable 

Analogously, we denote by U the manager's expected utility 

U (d, t) = E f [U {f, d, *)] = J> (/ I t) U (/, d, t) (5) 

According to assumptions A1-A2, the manager's expected utility is maximized when users adopt 
action profile d*(t) and the intervention device adopts action d* Q . However, in a strategic scenario 
the users adopt the actions that maximize their own utilities, and the possible outcomes are 

8 There is some abuse of notation in using the same symbol to indicate a random variable and a particular realization, but this 
will not lead to confusion. 



10 



represented by the NEs. The NEs of the game T t depend on the randomized intervention rule 
selected by the manager because it affects the utilities of the users. Thus, the manager has to 
design the randomized intervention rule so that there exists a NE of the game T t that gives it the 
highest utility among what is achievable with all possible NEs. 

Definition 1. A randomized intervention rule n is said to sustain an action profile d G D in T t 
if d is a NE of the game T t , i.e., if 



If such it exists, we say that d is sustainable. 

A randomized intervention rule ir sustains an action profile d G D in T t without intervention 
if tx sustains d and f(d) = d*$ for every intervention rule f such that tt (/ | t) > oj^l If such ix 
exists, we say that d is sustainable without intervention. 

Interpreting d*$ as the absence of intervention, the expression sustainable without intervention is 
here used to indicate that in the equilibrium the intervention action is not executed. We denote by 
T d,t the set of all randomized intervention rules, obtainable starting from the intervention rule set 
J 7 , that sustain d in T t without intervention. The possibility of the manager to design a randomized 
intervention rule capable of sustaining an action profile depends on the intervention capability, 
namely, the action space D of the intervention device and the class of intervention rules T the 
intervention device is able to implement. If we expand these sets, the manager has more degrees 
of freedom in designing intervention rules capable of sustaining action profiles. 

Definition 2. (Dq,^) is an optimal intervention capability with respect to the complete informa- 
tion game T t if the maximum utility that the intervention device can obtain considering all the 
sustainable action profiles cannot be improved by expanding Dq and T. 

C. Coordination mechanism formulation for the incomplete information game 

In this paper we consider the scenario where each user has private information, which is 
synthesized in its type. Following Harsanyi's approach ll2~8ll . we study the incomplete information 

9 These definition can be easily extended for pure intervention rule: / sustains d in Tt (without intervention) if it sustains d in 
Tt (without intervention), where n (/ 1 1) = 1 if / = /, otherwise. 




(6) 



11 



scenario assuming that each user acts based on the beliefs it has about the types of the other 
users. In particular, we denote by P t (-) the joint probability distribution of the type profile over 
the type profile set T. We assume that each type profile has a positive probability to occur, i.e., 
Pt(r) > 0, Vr G T. We denote by P t _ i (r^i) the joint probability distribution of the type profile 
of all the users except for user i over the set T_j (notice that user i knows its own type, tj). We 
assume that, for each user i, P t _ l ( T -i) is consistent with Pt(-), i.e., P t _ i (T^ i ) = P t (U,T^i | U). 

To reach its objective, the manager may program the intervention device to elicit information 
from users and to spread information into the system (notice that users' behaviors, and therefore 
the outcome of the system, depend on the information they have). We denote by Ri the set of 
all reports that user i can transmit to the intervention device and by Mj the set of all messages 
the intervention device can send to user i. As usual, we denote by G Ri the report sent by 
i, by r G R = x ie ^/i?j the report profile, by mj G Mj the message sent to user i and by 
m G M = x ieJ ^Mi the message profile. The messages sent and the randomized intervention rule 
adopted by the intervention device may depend on the reports sent by users. Hence, given the 
report profile r, we denote by m s [r) = (mf (r), . . . , (r)), mf : R — >■ Mj, the messages sent 
by the intervention device and by n (/ | r) the probability that the intervention rule / G T is 
adopted. Following Myerson's terminology lfl"6ll . we refer to (i?, M, m 5 ,7r) as the coordination 
mechanism implemented by the intervention device. 

The manager has to design the coordination mechanism to drive the outcome of the system 
towards its objective. In doing so, it has to consider that users might both send reports and 
adopt actions strategically, i.e., both information revelation and action enforcement issues must 
be addressed at the same time. Once the coordination mechanism is established, the interaction 
between users can be modeled as a Bayesian game 

V— (N,$,A,T,P t ,{Ui(-,-,t)}™ =1 ) (7) 

In this context, a strategic user i selects its report r, G Ri and its action di G Di in order to 
maximize its expected utility given the information and the beliefs it has. Precisely, a strategy for 
user i consists of a couple of functions ((pi, Si), (pi : Tj — >■ Ri represents the report of user i which 
may depend on its type. Si : Mi x Tj — > D { represents the action of user i which may depend on 
its type and on the message received; in fact the received message can carry information about 
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the types of the other users, that can be exploited by i to select the most appropriate action. We 
denote by <\> = {<fii} ieA f G $ the reporting strategy profile and by 5 = {Si} i&Af G A the action 
strategy profile. 

Fig. CD represents the different stages of the interaction between the users and the intervention 
device, which are summarized in the following. 
Stage 1: the intervention device announces the coordination mechanism (i?, M, m s , ir) 
Stage 2: each user i sends a report 4>i(t) to the intervention device 
Stage 3: the intervention device sends a message m« = mf (4>i(t)) to each user i 
Stage 4: each user i takes an action di = Si {m^ti) 

Stage 5: the intervention device monitors the users' action profile d, picks an intervention rule / 
following the distribution ix (• | 4>{t)), and adopts the action f{d) 
The utility U i of each user is the expectation over the randomized intervention rule of the 
original utilities, therefore, given the strategy profiles ((f>,8), 

U i {<f>,6 i t)=E f [U z (f,6(m s (<t>(t)),t),t)] =J2^(f\ <t>(t))Ui (/, S(m s (<P(t)), t), t) (8) 

In a Bayesian game a user selects its strategy in order to maximize the expectation of its utility 
with respect to the initial beliefs about the types of the other players. The expected utility of a 
user % having type U is 

Vd4>,S,t i )=E t _ tlu \U l (<P,S,t)}= ^P t (t|^)7r(/|0(t))^(/,5(m 5 (0(t)),t),t) (9) 

The strategy profiles (0, 5) is a Bayesian Nash Equilibrium (BNE) of the game if, for each 
user i e J\f, for every type U G T» and for every alternative strategy (j>i, SA for i, 

Vi (0, 5, U) > Vi St, 6-i, ti) (10) 

10 We remark the importance of communicating the mechanism and committing to it. If the intervention device could deviate 
from the mechanism and select an action to maximize the manager's utility, then, since Al is satisfied, the intervention device 
would adopt d c * independently of users' actions. The users, foreseeing this behavior, would ignore the threat of the intervention 
device and would play as if the intervention device were not present in the system. Conversely, forcing the intervention device to 
follow the mechanism and communicating it to users, allows the manager to design credible threats and obtain better outcomes. 
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Finally, the aim of the manager is to design an optimal coordination mechanism (i?, M, m s , 7r), 
such that there is a BNE (</>, 5) that gives the manager the highest possible expected utility 

V (0, 5) = Et [E f [U (/, S{m s {<f>{t)), t), t)] } =J2H U I <t>(t)) U (/, 5(m 5 (0(t)), t), t) 

(ID 

This formulation is rather abstract, so it may be worth to use a simple illustrative example to 
remark our goal. Assume the manager has to assign a resource to 2 users. From a social point of 
view, the best choice might be to assign the resource to the user having the higher valuation for 
that resource. Using a conventional MD scheme, the manager might implement a Vickrey auction 
to obtain the users' valuations and to select the user with the higher valuation. However, if a user 
could avoid the payment such method would fail its objective because that user could bid more 
than what it is really willing to pay. Moreover, nothing would prevent the user that has lost the 
auction from trying to access the resource. That is, conventional mechanism design relies on other 
systems (e.g., a reliable infrastructure to collect money and punishments for the users that do not 
respect the agreements) to be effective. Here we want to design a scheme that does not rely on 
external systems. As an example, the intervention device might be a device that asks the users 
to report their valuations and, based on that, proposes how to share the resource. If the users do 
not respect such sharing the intervention device might jam their communication. The mechanism 
used by the intervention device to propose the resource sharing and to jam users' communication 
must be designed to provide the incentive for both the users to report their true valuations and to 
accept the proposed resource sharing. 

III. Optimal incentive compatible direct mechanisms 

The design of an optimal coordination mechanism seems to be intractable since there are no 
constraints on the sets M, and Fortunately, the revelation principle [16] allows us to restrict 
the attention to the class of incentive compatible direct mechanisms, among which the optimal 
mechanism is also optimal in the class of all coordination mechanisms. In a direct mechanism users 
report their types to the intervention device, and the intervention device sends them a suggested 
action profile, i.e., Ri = Tj and M» = A, Vz G M. We denote by d s (r) = (df(r), . . .,d%(r)), 
df : T — > Di, the suggested action profile given the reported type profile t. We say that user i is 
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honest and obedient if it reports its real type and adopts the suggested action, i.e., if 4>i(ti) = U and 
Si(di,ti) = di, for every type ij £ and suggested action di £ A- Finally, a direct mechanism 
is incentive compatible if the honest and obedient strategy profile is a BNE, i.e., if it provides 
incentives for users to behave honestly and obediently. 

The Optimal Incentive Compatible Direct Mechanism (OICDM) can be computed solving 

OICDM argmax £ £ P t (t)7r (/ | t) C/ (/, d 5 (t), *) 
subject to: 

7T (/ | *) > , |t) = l , V/67, Vt£T 

> J]) ^ t (t | t-Ott (/ | ts) Z7* (/, d^C^, *i(df ^ 4 )), t) 

Vi £ {1, ...,n} , Vri£Ti, Vfi £ Tj, : A ->• A 

The second set of constraints of OICDM represents the incentive compatible condition. It asserts 
that when i's type is Tj, 2 does at least as well by being honest and obedient as by reporting % 
and then adopting Si(df fj)) when told to adopt df(t_j,fj), assuming that the other users 
are honest and obedient. If users were compliant to the manager's instructions, the mechanism 
could be thought as a way to retrieve the relevant information, compute the optimal policy and 
recommend actions to users. In this scenario the optimal mechanism could be computed solving 
OICDM without the second set of constraints. The design of a system that is robust against 
self-interested strategic users translates mathematically in additional constraints to satisfy, which 
represent the incentives given to users to follow the instructions. For this reason, the maximum 
utility the manager can obtain with self-interested strategic users is never higher than the maximum 
utility it can achieve with compliant users. We denote by Vq' ie the maximum expected utility that 
the manager can obtain when users are compliant, i.e., 

v ME = j2muoK,d*(t),t) d2) 
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We say that a direct mechanism is a maximum efficiency incentive compatible direct mechanism 
if it is a solution of OICDM and the expected utility that the manager can achieve is equal to 
the maximum efficiency utility. 

Finally, we define the concept of optimal intervention capability also for the incomplete infor- 
mation game T. 

Definition 3. (Do,^) is an optimal intervention capability with respect to the incomplete infor- 
mation game V if the solution of OICDM cannot be improved expanding Dq and T. 

A. Properties of a maximum efficiency incentive compatible direct mechanism 

In this Subsection we address the problem of the existence and the computation of a maximum 
efficiency incentive compatible direct mechanism. 

The first result we derive asserts that a maximum efficiency incentive compatible direct mecha- 
nism exists if and only if, for every type profile t, the optimal action profile d*(i) is sustainable in 
the game with complete information T t , and users have incentives to reveal their real type given 
that they will adopt d*(t) and the intervention device does not intervene. If this is the case, we 
are also able to characterize all maximum efficiency incentive compatible direct mechanisms. 

Proposition 1. (T, D, d s , 7r) is a maximum efficiency incentive compatible direct mechanism if 
and only if Vt G T, 

1: the optimal action profile d*(t) of the game T t is sustainable without intervention in T t ; 
2: each user has incentives to report its real type, when other users do it and everybody is 
adopting the optimal action profile d*(t) and the intervention device never intervenes, i.e, 

Pt{t\r i )U i {dldr{t),t)> Pt{t\r i )U i {dld\f l ,t. i ),t) 

VzGAT, Vr.GT, Vf.GT, (13) 

3: the suggested action profile is the optimal action profile of game T t , i.e., d (t) = d*(t); 

4: the randomized intervention rule sustains without intervention d*(t) in T t , i.e., n(- | t) G 

jrd*(t),t 



Proof: See Appendix lAl 
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Conditions 1-2 are related to the structure of the game without intervention device, while 
conditions 3-4 say how to obtain a maximum efficiency direct mechanism once 1-2 are satisfied. 

In the second result we combine condition 2 of PropositionQ]with assumptions A3-A6 to derive a 
sufficient condition on users' type set structures under which a maximum efficiency incentive com- 
patible direct mechanism does not exist. We define the bin size (3 k of user fc's type set, T k , as the 
maximum distance between two consecutive elements of T k : /3 k = max s£ {i v .^ mfc _i} (T k , s +i — T kiS ). 
We define the bin size f3 as the maximum between the bin sizes of all users: f3 = max fcgA/ - f3 k . 

Proposition 2. There exists a threshold bin size ( > so that if f3 < ( then a maximum efficiency 
incentive compatible direct mechanism does not exist. 

Proof: Let k G TV and r k G T k be such that d^ E \r k ,t^) > d* k (r k ,t_i), Vt_i G T_j. We 
rewrite condition 2 of Proposition \T\ for users k and type r k : 

Pt{t\r k )U l {dl,d*{t),t)> £ P t {t\r k )U i {dld*{t k ,t_ k ),t) , \/f k eT k , (14) 

i~ fc eT_ fc t- fc eT_ fc 

We have d^ R (d*_ k , t- k , r k ) > d^ R (d^\t. k ,r k ) = d^ E °(t. k ,r k ) > d> k (t- k ,n), where the first 
inequality is valid for the submodularity. 

Let f k (t- k ) be the type r so that d*(r,t- k ) = d k R (d*_ k ,t_ k ,r k ) if it exists (in this case A3 
guarantees it is greater than r k ) and it is lower than t k , and f k (t- k ) = t k otherwise. Let f k = 
Tma t _ k f k (t- k ). If (r k ,f k ]f)T k ^ (in particular, this is true if (3 < r k - r k = Q, Vr m G 
(r k ,f k ] f) T k we obtain 

U k (do, d*(t- k , T m ),t-k, T k ) > U k (do, d*(t- k , r k ), t- k T k ) , Vt_ fc G T_ fc (15) 

contradicting Eq. (TT4l) . ■ 

Interpretation: when user fc's type is r k , fc's resource usage that maximizes the manager's utility, 
dl(r k ,t_ k ), is lower than the one that maximizes fc's utility, d k R (d*_ k ,T k ,t_ k ), Vt_fc G T_ fc . If k 
reports a type r m slightly higher than r k , then the intervention device suggests a slightly higher 
resource usage, allowing k to obtain a higher utility. Hence, k has an incentive to cheat and 
resources are not allocated as efficiently as possible. To avoid this situation, the intervention 
device might decrease the resources given to a type r m . In this case the loss of efficiency occurs 
when the real type of k is r m and it does not receive the resources it would deserve. There is 
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no way to avoid the loss of efficiency associated to both case tk = Tk and case tk = r m , both 
occurring with positive probability. 

It is worth noting that we consider finite type sets and a finite intervention rule set mainly 
to simplify the logical exposition. However, all results might be derived also with infinite and 
continuous setsHJln particular, if type sets are continuous Proposition [2] implies that a maximum 
efficiency incentive compatible direct mechanism never exists. 

B. Properties of optimal incentive compatible direct mechanisms 

If a maximum efficiency incentive compatible direct mechanism exists, the optimal incentive 
compatible direct mechanisms set coincides with the maximum efficiency incentive compatible 
direct mechanisms set, that is characterized in Proposition [Q However, finding an optimal incentive 
compatible direct mechanism in the general case, solving OICDM, may be computationally hard. 
In this Subsection we consider some additional conditions to simplify the problem. First we 
assume that the manager's utility is a function of the users' utilities. Moreover, we suppose that 
the intervention capability (D , J 7 ) is such that, for each type profile t £ T, every action profile 
d E D lower than the NE action profile of the game T° is sustainable without intervention in T t 
(i.e., d < d NE (t) implies T d + non empty). Finally, we assume that, for each type profile t E T 
and for every action profile d E D, the utility of a user i adopting the lowest action d™ m is 
equal to 0, i.e., U^d^d™™ ,d-i,t) = 0. Interpreting d^ m as no resource usage, this means that, 
independently of types and other users' actions, a user that does not use resources obtains no 
utility. 

Lemma 3. The utility of user i is non increasing in the actions of the other users. 
Proof: 

Ufa i, t) = U M , 0, ,u t) + £ = jT 9U >^% X 

du,(dz,d,t) _ ["' a 2 u,(d' ,x,d,t) gx < 



ddj J dxddj 



"For the continuous case, probability distributions and sums must be substituted with probability density functions and integrals. 
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where the inequality is valid for the submodularity. ■ 
The following result allows the manager to further restrict the class of mechanisms to take into 
consideration. 

Lemma 4. There exists an optimal incentive compatible direct mechanisms such that, V£ G T, 
the randomized intervention rule sustains the suggested action profile without intervention in T t . 

Proof: See Appendix |B] ■ 
Lemma |4] suggests the idea to decouple the original problem, OICDM, into two sub-problems. 
First we can calculate the optimal suggested action profile d s (t) under the constraint that users 
adopting that action profile have incentives to report their real type. Finally, it is sufficient to 
identify an intervention rule able to sustain d s (t) without intervention in T t . This is formalized 
in the following. 

Consider the mechanism (t, D, d S ', If) , where 
-,s 



d = argmax^ P t {t)U (d* Q , d s (t),t) 



dS ter 



subject to: 

£ P t {t\T i )U i {dl,d s {t- h T i ),t)> Pt{t\r l )U l {dld^{t. l ,f l )Mdf{t. l} f l )),t 

Vie{i,...,n}, Wj e T, h v^gT,, : > (17) 



and, WteT, 



?f(- | t) G F dS ' 1 (18) 



Proposition 5. The mechanism (t, D, d S , tt j is an optimal incentive compatible direct mechanism. 



Proof: Eq. (fT8l) says that we are looking for a mechanism where, Vt G T, the randomized 
intervention rule sustains the suggested action profile without intervention in T t . Moreover, the 
constraint of Eq. (fTTT) says that the users have the incentive to reveal their true types if they adopt 
the suggested action profile. Lemma |4] states that such a class of mechanisms is optimal, hence, 
the solution of Eqs. (fT7l)-(fT8l gives an optimal incentive compatible direct mechanism. ■ 
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Corollary 6. The intervention capability (D , J 7 ) is optimal with respect to Y. 

IV. Sub-optimal incentive compatible direct mechanisms 

In this Section we provide practical tools for the manager to design efficient coordination 
mechanisms. Although we have characterized the optimal mechanism, other schemes are also of 
interest, for several reasons. First of all, the optimal intervention scheme may be very difficult 
to compute, even in the decoupled version of Eqs. (fT7T)-([l"8T). It is therefore of some interest 
to consider intervention schemes that are sub-optimal but easy to compute. Moreover, in some 
situations, it may not be possible for the users to communicate with the manager, so it is natural to 
consider intervention schemes that do not require the users to make reports. In the following, we 
address both issues. In Subsection IIV-AI we describe an algorithm that converges to an incentive 
compatible direct mechanism where the recommended actions are as close as possible to the 
optimal ones. In Subsection IIV-BI we consider a mechanism that is independent of users' reports. 

A. Algorithm that converges to an incentive compatible direct mechanism 

In this Section we propose a general algorithm (see Algorithm [B that converges to an incentive 
compatible direct mechanism. Such algorithm is run by the intervention device at the beginning of 
the interaction with the users in order to obtain the mechanism to adopt. After that, the interaction 
between the intervention device and the users is as usual: the intervention device communicates the 
mechanism, the users report their type, the intervention device suggests the actions to adopt, the 
users take actions, and finally the intervention device monitors users' actions and intervenes. This 
algorithm can be applied when the suggested action profile, for every type profile t and at each 
step of the algorithm, is sustainable without intervention in T t . The suggested action profile will 
never be lower than the optimal action profile d* (t) and higher than the NE action profile d NE it) 
of L°, so it is sufficient that J** is non empty Vi G T and Vd G D so that d*(t) <d< d NE \t). 

We denote by the expected utility that user i, with type U, obtains reporting type 

and adopting the suggested action, assuming that the other users are honest and obedient, i.e., 
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where we used the notation i — m, . . . , U-i,U, t i+ i, . . . , t n ). 

The algorithm has been designed with the idea to minimize the distance between the optimal 
action profile d*(t) and the suggested action profile d s (t), for each possible type profile t. To 
explain the idea behind the algorithm we use Fig. |2l where i's utility is plotted with respect to i's 
action, for a fixed type profile t and assuming the other users adopt the suggested actions d^it). 

The algorithm initializes the suggested action profile d s (t) equal to the optimal action profile 
d*(t) and selects a randomized intervention rule n (• | t) that sustains it without intervention, 
for every type profile t E T. This situation is represented by the upper-left Fig. |2] Also the 
NE and i's best response action are represented, d NE °{t) and d BR (d s _ i (t)). By assumption A6 
d*{t) < d NE °{t) and by assumption A5 d NE °(t) < d BR (d^(t)), because d^(t) < d NE \t). If 
Wi(ti,ti) > Wi(ti,ti), for every alternative i's reported type tj, then user i has an incentive to 
report its true type U. If, at a certain iteration of the algorithm, this is valid for all users and for 
all types they may have, then the algorithm stops and an incentive compatible direct mechanism 
is obtained!^ 

Conversely, suppose there exists a user i and types U and U such that Wi(U, U) < Wi(ti, U), i.e., 
user i has the incentive to report U when its type is U. Then the suggested action df(t) is increased 
by a quantity equal to e^, moving it in the direction of the best response function df R (d s _ i (t)), for 
every possible combination of types t_i of the other users, and updates the randomized intervention 
rule 7T (■ | t) in order to sustain without intervention the new suggested action profile. This has the 
effect, as represented by upper-right Fig. [2l to increase JJi (o^, d s (t),t), Vt_j G T_j, and therefore 
also the expected utility of i when it has type t« and it is honest, W(ti,ti). This procedure is 
repeated as long as Wi(ti,ti) < and df(t) < df E °(t). In case i's suggested action df(t) 

reaches df E °(t) and still Wi(ti,ti) < Wi(ti,ti), then the suggested action of user k, df(t), is 
increased by a quantity equal to e k , V k E M, k ^ i, V t_j E T„j. As we can see from lower- 
left Fig. [2l this means to move the best response function d ER (d s _ i (t)) in the direction of the 
suggested action df{t). If df(t) reaches d^ E °(t) as well, V ' k E M, then d£ i (t) coincides with the 

12 Notice that, if a maximum efficiency incentive compatible direct mechanism exists, since it must satisfy the conditions of 
Proposition Q] then the initialization of the algorithm corresponds to a maximum efficiency incentive compatible direct mechanism 
and the algorithm stops after the first iteration. 
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best response function d BR (d s _ i (t)), as represented in the lower-right Fig. [2] In fact, by definition, 
the NE is the action profile such that every user is playing its best response action against the 
actions of the other users. Since df(t) coincides with d BR (d s _ i (t)), Vt_j G XI j, user i is told to 
play its best action for every possible combination of the types of the other users. Hence, user 
i cannot increase its utility reporting a different type tj, therefore the mechanism is incentive 
compatible. 

The algorithm stops the first time each user has the incentive to declare its real type. Since at each 
iteration the suggested action profiles are increased by a fixed amount, the algorithm converges 
after a finite number of iterations. The higher the steps e,, i G M, the lower the convergence time 
of the algorithm. On the other hand, the lower the steps, the closer the suggested action profile 
to the optimal onellf 

Algorithm 1 General algorithm. 
i: Initialization: Vi G X, d s (t) = d*(t), it (• | t) G J 7 ^'*. 
2: For each user i G M and each couple of states t^ti G Xj 

3: If Wi(ti,ti) < Wi(ti,ii) 

4: If df(t h t-i) < df E \u, t-i) for some t_ 4 G T_i 

5: df(t i: U) <- min [dffaU) + e», df it (■ | t) G F dS >\ Vt_, G T_, 

6: Else 

7: df{U,U) <- min { + e fc , d^(ti,t_i)}, vr (• | t) G J 7 ^'*, E N, k ^ i, 
Vt_, G T_. t 

8: Repeat from 2 until 3 is unsatisfied Vi, tj, £_j 



B. A priori direct mechanism 

In this Subsection we consider a new type of mechanism, namely an a priori mechanism, 
where users' reports do not play any role for the final outcome. This is particularly useful in 



13 Notice that, since no assumption such as convexity is made for the manager's expected utility Vo, an action profile closer to 
the optimal one does not necessarily imply a better outcome for the manager. 
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situations where it is not possible for the users to communicate with the manager. However, 
also for scenarios where users can send reports, an a priori mechanism might represent a good 
sub-optimal mechanism that is efficient and easy to compute. 

Definition 4. (T, D, d s , it) is an a priori direct mechanism if it is a direct mechanism and the 
suggested action profile d s and the selected randomized intervention rule n do not depend on 
users' reports. (T, D,d s , 7r) is an a priori incentive compatible direct mechanism if it is an a 
priori direct mechanism and it is incentive compatible. 

In an a priori direct mechanism stages 1-3 described in Subsection III-CI can be compressed in 
only one stage in which the intervention device communicates to the users the suggested action 
profile d s and the randomized intervention rule ir. In an a priori incentive compatible direct 
mechanism the incentive compatibility condition must be checked only for users' actions and 
OICDM simplifies irQ 

argmax E E P t (t)ir (f) U (f, d s , t) 

dS ' n teT feT 

subject to: 

tt(/)>0 , 5>(x|t) = l , V/e J 

t-ieT-i feT t-ieT-i fe? 

Vie{l,...,n}, Vr.eT,, V^:A^A (20) 

Definition 5. A randomized intervention rule tt sustains an action profile d G D in V if d is a 

BNE of the game Y, i.e., if Vz G Af , Vr^ G Tj , Vrfj G D it 

E E p *(*i r *M-f i*)^^'*)^ E E^k.x/i^i^^-^) (2d 

14 Notice that the optimal a priori incentive compatible direct mechanism attainable solving J20b is in general suboptimal compared 
to the optimal a priori direct mechanism. In fact, the revelation principle does not hold for a priori mechanisms since we are 
adding an additional constraint, forcing the mechanism to be independent of users' reports. 
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A randomized intervention rule tt sustains an action profile d £ D in T t without intervention if 
7r sustains d and f(d) = d^ for every intervention rule f such that n (/ 1 1) > 0. If such tt exists, 
we say that d is sustainable without intervention. 



If any action profile is sustainable without intervention in T, then (1201) can be decoupled and 
an optimal a priori incentive compatible direct mechanism can be computed as a solution to the 
following unconstrained optimization problem: 

d S = argmax P t (t)U (d*, d s ', t) (22) 
dS ter 

g 

and 7f (/) sustains d in Y without intervention. 

V. Application to flow control 

In this Section we apply the results derived in Sections [III] and [IV] for the abstract framework 
to a concrete scenario: the design of a flow control management system. 

A. Formulation of the flow control problem 

We consider n Poisson streams of packets with arrival rates d\, d 2 , d n that are serviced by a 
single server with exponentially distributed service times with mean -. Since we assume that all 
packets have the same length, we will talk interchangeably of arrival rate (^f-) and transmission 
rate (Mbps), and /i can be seen as the channel capacity, in — , after the server^ We refer to 
each stream of packets as a user. We assume that each user i can control its own traffic (e.g., 
by adjusting the coding quality of its communication), i.e., it can select its transmission rate 
di E Di = [0, fj]. As represented by Fig. HI the system is an M/M/l queue with an input arrival 
rate A = Ya=i d i- 

In most cases a user is faced with two conflicting objectives, i.e., to maximize its throughpu(_^ 
and to minimize its average delay. The conflict between throughput and delay is obvious since as 

l5 We consider packets of the same length to keep a simple notation and because the qualitative results are not affected by this 
hypothesis. However, the model and the analysis can be easily extended to take into account packets of different lengths. 

l6 Here the throughput refers to the traffic the server is able to service, i.e., the transmission rate available to the user, and does 
not take into account the packets lost due to physical layer transmission errors. 
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more traffic enters the server queue the delays become larger. In order to incorporate these two 
measures in a single performance metric, the concept of power has been proposed in ||29l and 
later extended in ll30l . It is defined as the ratio between the throughput and the average delay, 
where the exponent of the throughput is a positive constant. We can therefore write z's utility as 



where d = (dx, . . . , d n ) denotes the transmission rate profile and the parameter U > represents 
user i's type. 

The value of tj may depend, for example, on the quality of service of the application corre- 
sponding to the i-th stream of packets. As we will see in Eqs. (1251) and (|29|) . both considering 
compliant users and strategic users, the rate adopted by a user is increasing in its type. This 
consideration suggests the idea that the higher the type of a user, the higher the importance of 
the rate, with respect to the delay, for that user. As an example, streams of packets associated to 
delay dependent applications should have a low type while streams of packets associated to delay 
tolerant applications should have a high type. 

In general, the applications a server has to deal with may change over time. For this reason it 
is useful to define a common type set T, = {n, r 2 , r m }, m G N, Tk G R, T\ < r 2 < ... < r m , 
Vz G M = {1,2, ...,n}, whose elements represent all the possible types of users a server has to 
deal with. Suppose that at the beginning of the communication a user does not know the types 
of the other users and the intervention device itself does not know the types of the users. We 
assume that a common probability distribution exists and that user types are independent and 
identically distributed (i.i.d.) with P(U) denoting the probability that a user has type U, U G T i? 
and P t (t) = niLi p {ti) tne probability that the type profile is t, t G T = T™. P(U) can be thought 
as the average fraction of applications having type U that require services to the server. 

The network must be designed to operate efficiently following the manager's objective, which 
can be quantified by a utility function. We assume that the manager's utility is the geometric mean 
of the users' utilities: 




(23) 




n 



n 



(24) 
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where = maxji , 0}lZI This choice allows to maintain a balance between two competing 
interests a benevolent manager might have: to maximize the social welfare of the network (defined 
as the sum utility) and to allocate resources fairly, giving to users similar utilities. Notice that 
maximizing Uo(d,t) with respect to users' actions is equivalent to maximizing a proportional 
fairness of users' utilities, i.e., J2i=i m U?~(d, ti), and the optimal solution d* = {d*}™ =l is given 
by (see (H) 

d*(t) = r|£ , (25) 

n + E fc =i h 

We denote by Vi(d(t),ti) and Vo(d(t)) the expected (with respect to the types) utilities of user 
i having type t t and of the manager, where d(t) represents the action adopted by the users when 
the type profile is t, i.e., 

Vi(d(t),U)= Pt(t\U)Ui{d(t),t) (26) 

t-i£T_i 

V (d(t)) = ^P t (t)U {d{t),t) (27) 

teT 

Hence, the maximum expected utility that the manager can obtain when users are compliant is 

V Q ME = Vo(d*(t))- 



B. The flow control games 

In this Subsection we compute the outcome of a flow control problem considering self-interested 
and strategic users, for both the complete and the incomplete information scenarios. Moreover, 
we quantify the loss of efficiency of the manager's utility with respect to the maximum efficiency 
utility. 

17 We consider Uf instead of U% for mathematical reasons, because utilities as defined in Eq. ( 13 1 1 > may also be negative, and 
the geometric mean would lose meaning with negative quantities. Anyway, notice that it is in the self interest of both the users 
and the manager to have A < /i, i.e., working in the sub-space of the original domain such that = Ui. 
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1) The complete information game T®: We define the complete information game 

r°= (AT,A{^(-,t)}f =1 ) (28) 

where each user i selects its action di(t) strategically, knowing the types t of all the users. 
The unique NE df E °(t) of T° t is, Vz G AT, (see 0) 

df E \t) = J* - (29) 

Notice that strategic users use the resources more heavily with respect to compliant users, i.e., 
df E °{t) > d*{t), VieAf and Vt G T (excluding the trivial case n = 1). 

The manager's expected utility in the complete information scenario is equal to V (d NE °(t)). 

2) The Bayesian game T°: We define the incomplete information game 

r° = (A^A^t,{^MKLi) (30) 

where each user i selects its action dj(^) strategically, knowing its own type ti and the probability 
distribution over the types of the other users, P t . 

Proposition 7. There exists a unique Bayesian Nash Equilibrium d BNE (t) of T° which can be 
obtained by solving a linear system Ad BNE = b. In addition, the inverse of A, A -1 , can be 
computed analytically^ 

Proof: See Appendix ■ 
The manager's expected utility in the incomplete information scenario is equal to Vo(d BNE (i)). 

3) Results: Fig. 0] shows the manager's expected utility with respect to the number of users, 
considering fi = 5 Mbps and a type set Tj = {0.1, 1} with uniformly distributed types. The 
upper curve represents the maximum efficiency utility, attainable when users are compliant to 
the manager, while the dashed and the dotted lines represent the manager's utility when users 
are strategic in the complete and incomplete information cases respectively. The manager's utility 
when users act strategically, both for the complete and incomplete information scenarios, is far 
below compared to the maximum efficiency utility. Notice that the manager can obtain a higher 
utility in the incomplete information scenario with respect to the complete information scenario, 

18 The expressions of b, A and A -1 can be found in Appendix Icl 
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at least when there are more than three users in the system. This agrees with the results of OTA . 
Il32ll where, in a strategic setting, the less closely related the agents' goals the lower the quantity 
of information they prefer to exchange. In our case, the objective of the manager becomes less 
closely related to the objective of a single user as the number of total users increases. In fact, the 
manager's objective is to increase the utility of all users in a fair way, while the goal of a user 
is to improve only its own utility, at the cost of the utility of all the other users. Hence, as the 
number of users increases, the selfishness of a single user has a higher negative impact on the 
manager's objective. 

C. The flow control games with intervention 

Fig. |4] shows that the manager's expected utility in strategic settings is much lower that the 
manager's expected utility in cooperative settings, when users are compliant. Here we ask whether 
the manager can design the system in order to make it robust against self-interest strategic users, 
filling, at least partially, the gap between the maximum efficiency utility and the manager's 
expected utility in strategic settings. 

Using the same notations as in Section HU we consider an intervention device that transmits a 
stream of packets to the server with a rate d E D = [0, d^f], following an intervention rule 
/ : D D . d^f represents the maximum rate the intervention device is able to transmit. The 
framework introduced is Subsection IV-AI can be easily modified to take into account the effect of 
the intervention device, that increases the incoming traffic of the server A = YH=\ d% + f(d). The 
users' and the manager's utilities change accordingly: 



Ui(f, d, U) = dt? (ji - A) = d'i L-£>- f( d ) 



■i=i 



U (f,d,t) 



nw.^*i)=o*-A) + n d *" ( 3i ) 

\ i=l i=\ 

It is straightforward to check that the users' and the manager's utilities satisfy assumptions 
A1-A6. In particular, the manager's preferred action is d = (i.e., no intervention), and the 
game T° defined in Subsection III- Al coincides with the game T° defined in Subsection IV-B2I 

In the following we define a simple class of intervention rules, the class of affine intervention 
rules, where the intervention level increases linearly with the users' actions. We limit the design of 
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intervention rules to affine intervention rules, i.e., J 7 coincides with the class of affine intervention 
rules. It may seem restrictive to constrain the intervention device to such a simple class of 
intervention rules. However, under certain conditions, the class of affine intervention rules will 
turn out to be optimal, i.e., it is not possible to increase the manager's utility by expanding the 
intervention rule set T . 



Definition 6. / : D —> D is an affine intervention rule if 



i=l 



(32) 



for certain parameters di > and Ci > 0, where [-] b = min {max {a, •},&}. 



In an affine intervention rule, di represents a target action for user i while c* represents the 
rate of increase of the intervention level due to an increase of z's action. If the action profile d 
is lower than or equal to the target action profile d = \ di, ■ ■ ■ , d n J , then the intervention level is 
equal to 0. If the intervention level is higher than 0, then some user is adopting an action higher 
then the target one. In this case, an increase by an amount e of z's action causes an increase in 
the intervention level by an amount c*e. 

Fig. |5] shows how an affine intervention rule changes the relation between z's utility and z's 
action. The utility of user z is plotted for tree cases: assuming that the intervention device never 
intervenes and assuming that the intervention device adopts a linear intervention rule, for two 
different values of the parameter q. We consider that the other users adopt the target action profile. 
For an action di lower than the target action di, z's utility is as if the intervention device did not 
exist. However, for an action di higher than the target action di, z's utility is lower compared to the 
utility it would have obtained without intervention device, and the gap increases as q increases. 

In the following, we provide the tools for the manager to design the intervention rule, for both 
the complete and the incomplete information scenarios. 

1 ) The complete information game T t : This is the scenario considered in Subsection lII-B[ where 
the users adopt actions strategically, knowing the type of the other users and the intervention rule. 
The interaction between users is modeled with the game T t , 

r t =(Af,D,{U i (f,-,t i )} r > =1 ), (33) 
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where the utilities Ui, i G ftf, are calculated as in Eq. (|3D) . The outcome of such interaction is 
represented by the NE. 

The manager faces the problem of designing a randomized intervention rule so that there exists 
a NE of the game T t that gives it the highest utility among what is achievable with all possible 
NEs. We will see that it can reach this objective considering only pure intervention rules, which 
are simpler to implement than randomized intervention rules. 

Lemma 8. Consider the affine intervention rule f such that, for every user i G M, 



Interpretation: Selecting a q high enough and if the intervention device is able to transmit with 
a large enough transmission rate, the threat of punishment discourages the users from adopting 
actions higher than the target. This situation is shown in Fig. |5]for c, = 2. Hence, if the utility 
of user % is increasing before the target action d{ (in particular, this is valid if di < df E ), as in 
Fig. [5l the target action d { becomes the best response action for user i. 

Proposition 9. If d¥ > — — — , then the optimal strategy profile d*{t), V£ G T, is sustainable 

1 + T i 

without intervention using an affine intervention rule f with d = d*(t) and > n — 1. 

If dy 1 > jJL, then every strategy profile d < d NE °(t), Wt G T, is sustainable without intervention 
using an affine intervention rule f with a high enough c it i G M . 

Proof: First, consider the second affirmation. The condition of Eq. (1341) on d\ l is automatically 
satisfied if the right hand side is lower than 0. Moreover, if it is higher than 0, the right hand side 
is increasing in q. In fact, the function gf(cj) = ^^r ? with a, b > 0, is increasing in q, because 
9'( c i) = (b+dc-) 2 > 0" Thus, the condition of Eq. (1341) on d^ 1 becomes stricter as q increases. 
Taking the limit for q — > +oo we can find the following stricter condition on d^ 1 that does not 




(34) 



If d < d , then f sustains d in T t without intervention. 



Proof: See Appendix ITJ1 
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depend on cf. 

iM u (t* - ELi dk) -di » J (U + l) 

c > — ^ — - — L — = v. - J2 dk - —+ — - < 35 ) 

In order to obtain conditions that are independent of users' types and action profiles to sustain, 
we can consider the following stricter conditions: 

am ^ ,, 5 di (r m + 1) M 

«o > V- 2_^ dk ' d o > A* (36) 

fc=i Tm 

As for Cj, we can find a stricter condition independent of users' types substituting U with r m . 
Thus, once the action profile to sustain is fixed, it is sufficient to select a c, satisfying 

Tm [ fJ> - Y2=l dk) - di 

Ci > ^ ~ (37) 

di 

Now consider the first affirmation. Substituting d*(t) into Eq. (1341) we obtain 

& > — ~ — 1 = n + > t k — 1 = 7-1 — 1 (38) 

As to d^f , substituting d*(t) into the first condition of Eq. ( |36*1 ) we obtain 

<ff >/*-/* ^h tk . = . (39) 

Finally, since the right hand side is decreasing in Ek=i tk, a stricter condition can be obtained 
substituting t k = r 1? \/k G Af, obtaining 

# > (40) 

1 + Ti 



If the intervention device is able to transmit a stream of packets with a rate higher than a certain 
threshold (that is upper-bounded by p), the manager can extract the maximum utility from the 
game T t adopting pure affine intervention rules. The following corollary is an implication of this 
consideration. 

Corollary 10. If dff > — — — , then the class of affine intervention rules is optimal with respect 

1 + Ti 

to T t . 
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Finally, the manager's expected utility for the complete information scenario with intervention 
device, considering affine intervention rules and assuming the condition on d^ is satisfied, is 
equal to the maximum efficiency utility Vo(d*(t)). 

2) The Bayesian game Y: This is the scenario considered in Subsection III-CL where the users 
adopt actions strategically, knowing their own type and the probability distribution over the types 
of the other users. The manager faces the problem of designing a direct mechanism (T, D, d s , 7r) 
so that the outcome of the system maximizes its own utility. The interaction between users is 
modeled with the game T defined by Eq. ©. Since assumptions A1-A6 are satisfied, we exploit 
the results obtained in Section [TIT] for the abstract framework. 

• Existence and calculation of a maximum efficiency incentive compatible direct mechanism. 
We wonder if there are some conditions under which the manager can design a mechanism to 
obtain the same utility it would achieve with compliant users. The following result provides 
an answer to this question. 



Tj+l 




) ' 


( n \ 







> 1 (41) 



Proposition 11. If Vt* G T t and Vt_* e T_ 4 , 

n + Y,^ih + n+i 

then the mechanism (t, D, d S , W j where, Vt G T, 

d S {t)=d*{t) 

lf{-\t) eJ^ 3 ' 1 (42) 
is a maximum efficiency incentive compatible direct mechanism. 

Proof: See Appendix [E] ■ 
• Characterization of the optimal incentive compatible direct mechanism. 

In case a maximum efficiency incentive compatible direct mechanism does not exists, the manager 
faces the problem of designing a mechanism such that it obtains a utility as close as possible to 
the maximum efficiency utility. If d^ 1 > fi, all additional assumptions made in Subsection IIII-BI 
are satisfied. Hence, according to Proposition [51 there exists an optimal incentive compatible direct 
mechanisms such that the intervention device adopts a randomized intervention rule that sustains 
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without intervention the suggested action profile. Such a mechanism can be calculated by solving 
independently Eqs. (TTvT) and (fT8l) . and (fT8l) can be solved considering pure affine intervention 
rules that are simpler to implement than randomized intervention rules. Moreover, according to 
Corollary [6l the class of affine intervention rules is optimal with respect to Y. Unfortunately, 
the solution of Eq. (fT7l) is hard to compute. For this reason, in the following we consider the 
suboptimal mechanisms proposed in Section [IV] 

. Algorithm that converges to an incentive compatible direct mechanism. 
As for the abstract framework in Subsection IIV-AI we consider an algorithm (see Algorithm 
[2k optimized for the flow control scenario, that converges to an incentive compatible direct 
mechanism. 

Algorithm 2 Flow control algorithm. 
i: Initialization: Vt e T, d s (t) = d*(t), ir(f | t) = 1 for a certain / e J 7 ^-* and n(f | t) = 

for / ^ /. 
2: For s — 1 : m 
3: For I — 1 : m 

4: UWi(T a ,T s )<Wi(T a ,n) 

5: d*(n, t-i) min {d*(n, + e i; df B °(r;, tt(/ | t) <- 1 for a certain / e J 7 ^'* 

and tt(/ I t) = for / ^ /, Vt_< G T_ 4 
6: Repeat from 2 until 4 is unsatisfied V s and / 



• A priori mechanism. 

Consider the a priori mechanism where the intervention device, independently of users' types, 
suggests action profile d and adopts the affine intervention rule /, 



d = argmin — In | ji — di J E t 



i=l 



IK 

i=l 



di>0 , d» < (M , \/i eU 



(43) 



Proposition 12. E 1 ^. defines a convex problem if r m < n. Moreover, if the randomized 
intervention rule n sustains d without intervention in T, then (T, D, d, 7r) is an optimal a priori 
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incentive compatible mechanism and the manager's expected utility is Vo(d). 

Proof: See Appendix E ■ 

D. Results 

In the following we are going to quantify the manager's expected utility and the expected 
throughput and delay for each type of user in different scenarios. We consider p, = 5 Mbps and 
a common type set Tj = {0.1, 1}. Except for Fig. [7J we assume that the types are uniformly 
distributed, i.e., -P(O.l) = -P(l) = 0.5, and we plot the results varying the number of users from 
2 to 16. 

We first look at how the manager's expected utility varies increasing the number of users, in 
the complete and incomplete information scenarios. The left side of Fig. prefers to the complete 
information scenario. The overlapped upper lines represent the manager's expected utility when 
users are compliant and when they are strategic with an intervention device that adopts the optimal 
intervention rule derived in Subsection IV-C1I The manager's expected utility is decreasing in the 
number of users because, as the number of users increases, the total congestion experienced 
by every user increases as well. However, it is remarkable that with the intervention scheme the 
manager can completely fill the gap between the maximum efficiency utility and its expected utility 
when the users are strategic but no incentive scheme is adopted (dotted line). The right side of 
Fig. prefers to the incomplete information scenario. In this scenario the manager is guaranteed to 
achieve the maximum efficiency utility using the mechanism derived from the algorithm (dashed 
line) if the number of users is sufficiently small. In fact, for a number of users less than or 
equal to 3, it is straightforward to check that the sufficient condition (|4T|) is satisfied, hence, a 
maximum efficiency mechanism exists and the algorithm converges to it. For a larger number 
of users, there is no guarantee of optimality, and in fact the results of Fig. 4 show that in this 
case the manager's expected utility is lower than what could be obtained with compliant users. 
However, the manager can still considerably increase its expected utility compared to the case of 
strategic users and no incentive scheme (dotted line), by adopting the mechanism derived from 
the algorithm for a number of users lower than 8 and the a priori mechanism (dash-dot line) for 
a number of users greater than or equal to 8 (/ defined in (??) turns out to sustain the solution of 
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(l43l) without intervention in T). It is not surprising that the a priori mechanism is able to obtain 
good performance for a high number of users, in fact in this situation the manager is able to 
foresee more accurately the fraction of users of a certain type, hence the information about users' 
types becomes less important. 

Now we investigate how the results depend on the type probability distribution for the incom- 
plete information scenario. In Fig. [7] we fix the number of users to 4 and we vary the probability 
of the low type, P(0.1), from to 1, which is equivalent to varying P(l) from 1 to 0. We can see 
that the gap between the maximum efficiency utility and the manager's expected utility achievable 
with the mechanism derived from the algorithm is not strongly dependent on the type probability 
distribution. In fact, such a mechanism provides incentives for each type of user to be honest and 
obedient, even though some user types occur rarely. On the contrary, the a priori mechanism is 
strongly dependent on the probability distribution of user types. In fact, the recommended and 
enforced action profile depends exclusively on the type probability distribution. As an example, 
if the low type occurs rarely, the intervention device will suggest to the users to adopt an action 
profile that is close to the objective of the users with high type, that will probably be the majority 
of the users in the network. In the extreme case, if low type users are for sure not present in the 
network (i.e., P(0.1) = 0), than the adopted action profile will maximize the interests of the users 
having high type and the a priori mechanism is able to achieve the maximum efficiency utility. 
Notice that in this situation the manager has no uncertainty about the types of the users in the 
network, which is the reason why it is able to extract the maximum utility. In some sense, the 
uniform probability distribution represents the worst case for the a priori mechanism because the 
manager has the highest uncertainty over the types of the users in the network. 

So far we have only considered the utility as performance indicator. However, the utility includes 
the two real performance metrics, the throughput and the delay. Now we investigate the expected 
throughput and delay achievable with the considered schemes in the complete and incomplete 
information scenarios, for each type of user[|£| Fig. [8] shows the expected throughput (left-side) 

19 Notice that all users in the network experience the same delay. However, such delay depends on the type profile: the higher 
the number of high type users with respect to the number of low type users, the higher the delay. Thus, the expected delay for a 
low type user is lower than the expected delay for a high type user. 
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and delay (right-side) for the complete information scenario. Continuous lines refer to the high 
type users, while dashed lines refer to the low type users. Notice that the high type users obtain a 
higher expected throughput and a higher expected delay compared to the low type users (this will 
be true also for the incomplete information scenario), confirming that the higher the type the higher 
the user's preference for throughput with respect to delay. In both pictures, the upper (continuous 
and dashed) lines refer to the strategic scenario without intervention device, in which the users 
adopt the NE action profile, while the overlapped lower (continuous and dashed) lines represent 
the optimal action policy, obtainable with compliant users or with strategic users subject to the 
intervention rule derived in Subsection IV-C1I With no incentive scheme, strategic users tend to 
overuse the resources of the network, transmitting with higher rates compared to the optimal ones. 
This translates into much higher delays, that increase quickly as the number of users increases. 
Conversely, the optimal transmission policy is such that the expected delay is almost constant with 
respect to the number of users. This means that also the aggregate throughput is almost constant, 
and the rate of each user scales as -. 

n 

Fig. [9] shows the expected throughput (left- side) and delay (right-side) for the incomplete 
information scenario. Continuous lines refer to the high type users, while dashed lines refer 
to the low type users, with the exception of the performance obtainable adopting the a priori 
mechanism, represented by the dash-dot line, in which different types of users adopt the same 
action and experience the same throughput and delay. In both pictures, the upper (continuous and 
dashed) lines refer to the strategic scenario without intervention device, in which the users adopt 
the BNE action profile, while the lower (continuous and dashed) lines represent the optimal 
action policy. The performance obtainable adopting the mechanism derived from the algorithm 
lies between them. The lines that represent the expected delay for the BNE action profile are 
truncated for a number of users equal to 3 and 5 because the system might become unstable. In 
fact, in the BNE the expected utility of a user is maximized, given that the other users adopt the 
BNE. However, for some type profile instances, the utility might be equal to 0, i.e., the delay 
might diverge. Thus, the expected delay diverges as well. In words, there is a positive probability 
that the network becomes congested. The mechanism derived from the algorithm allows to improve 
this situation, limiting the delay experienced by each user. However, such a delay increases almost 
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linearly as the number of users increases. This is the reason why the a priori mechanism, at a 
certain point, even though it is not able to differentiate the service given to different classes of 
traffic, is able to obtain a better performance (from the manager's utility point of view) than the 
mechanism derived from the algorithm, In the a priori mechanism each user, independently of its 
type, adopts a rate which is between the optimal rates adopted by the low type users and the high 
type users, and this situation reflects in the expected delay. This allows to keep a very low and 
constant delay with respect to the number of users. 

VI. Conclusion 

In this paper we extend the intervention framework introduced by [fTTl to take into account 
situations in which users hold relevant information that the manager cannot observe. To design 
a system that is efficient and robust to self-interested strategic users, the manager must provide 
the incentives for the users to report truthfully and to follow the recommendations. For a class 
of environments that includes many resource allocation games in communication networks, we 
provide conditions under which it is possible for the manager to achieve its benchmark optimum 
and conditions under which it is impossible for the manager to achieve its benchmark optimum. 
In both cases, we are able to characterize the optimal coordination mechanism the manager should 
adopt. Although we can characterize the optimal mechanism, we also describe a suboptimal 
mechanism that is easy to compute and a suboptimal mechanism that does not rely on the 
communication between the users and the intervention device. Finally, we apply our framework 
and results to the design of a flow control management system. Computations show that the 
considered schemes can considerably improve the manager's utility. 

Appendix A 
Proof of Proposition CD 

Proof: 

We prove the result by contradiction. 
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(T, D, d s , 7r) is a maximum efficiency incentive compatible direct mechanism. Suppose that 3t 
such that d s (i) ^ d*(i), then 

argmax V (</>*, 5*) = argmax ^ ^ P t (t)vr (/ | t) Z7 < 

< argmax ]T ]T P t (t)n (/ | t) C/ (/, *) + P t (t)U (d* , d*(t),t) < V ME (44) 

Now suppose that 3t such that ir (• | i) If 7r (• | t) sustains d*(£) but 3/ such that 

(j | t ) > and / (<F(f)) ^ d*, then 



7T 



argmax Vb (0*, 5*) = argmax ]T ^ P t (t)7T (/ | t) U (/, d 5 , t) < 

< ^ p t (t)u (d* ,d*(t),t) + (i -tt(/ i i))u (d* ,d*(i),i) + ir(f i i)u (f,d s (i),i) < 



rME 



< Pt(t)U Q (d* Q ,d*(t),t) + (1 - tt(/ I t))U (d*,d*(t),t) + tt(/ I t)U (d*,d*(t),t) = V A 

(45) 

If 7t (■ | t) does not sustain d*(t), then 3z and dj such that f/j (c/*(f),t) < ^dj, d^t), tj. In 
this case the intervention device is not able to provide incentive to user i to adopt optimal strategy 
d*(t) when the type profile is t, therefore the mechanism is not incentive compatible. 

Finally, 2 is a particular case of the incentive-compatibility constraints of OICDM, therefore 
it must be satisfied. 

<= 

It is straightforward to verify that a mechanism satisfying 1 — 4 is incentive compatible and the 
utility of the intervention device is equal to Eq. (TT2|) . 



Appendix B 
Proof of Lemma [4] 

Proof: 

Let (T, D, d s , 71") be an optimal incentive compatible direct mechanism. 
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Given a type profile t, we use the notations 



Df 



a i (t)=E f [U i (f,d s (t),t)] (46) 
We define the function <?i(d_j) in the domain Z?£ 4 as follows: 

^i(d-i) = {d* G -Df such that £/; (d£, d, t) = a { } (47) 

The function <^ is a non-empty set-valued function from Z?£ { to the power set of D^. In fact, 

Vt G T and d_; G D£ t , 

t/i (d*, df n , d-i, *) = < 0i < f/i (d*, df (t), d£i(t), t) < LT< (d*, df (t), d_ l; *) (48) 



The second inequality of Eq. (1481) is valid because z's utility is non increasing with respect to the in- 
tervention level, i.e., U { (/, d 5 (t), t) < U { (d*, d 5 (t), t) , V /, which implies that E f [U { (/, d 5 (t), i)] < 
£/i (dp, d s (i), t) . The last inequality of Eq. (|48T) is valid because z's utility is non increasing in the 
actions of the other users and, from the definition of the set Df, d£j(t) > d_j, Vd_j G £>f. Eq. 
(1481) and the continuity of z's utility imply that an action dj G Z?f satisfying C/j ^dp, dj, d_j, = dj 
exists, Vd„j G Z?£ 4 . Moreover, by definition g t has a closed graph (i.e., the graph of g ri is a closed 
subset of D£j x Z)f) and, since z's utility is non decreasing in d™ m df E °{t) , gi(d-i) is convex, 
Vd_, G £>£«. 

We define the function g{d) = (^ 1 (d_i), • ■ ■ ,g n (d- n )), Vd G -D 5 . g is defined from the non- 
empty, compact and convex set D s to the power set of D s . Thanks to the properties of gi, g has 
a closed graph and g{d) is non-empty and convex. Therefore we can apply Kakutani fixed-point 
theorem [33J to affirm that a fixed point exists, i.e., there exists an action profile d G D s such 
that Ui ^dQ,d, tj = a,i, Vz G Af. For each type profile t G T there exists a different fixed point, 
hence, we use the notation d(t). Notice that d(t) < d NE °(t), therefore the intervention device is 
able to sustain d{t) without intervention. 

Finally, the original optimal mechanism can be substituted by a mechanism where, Vt G T, 
the intervention device suggests d(t) and adopts a randomized intervention rule able to sustain 
it without intervention. In the new mechanism, the users are obedient because the intervention 
rule sustains d(t) and they are honest because the utilities they obtain for each combination 
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of reports are the same as in the original incentive compatible mechanism. The utility of the 
intervention device, which depends only on the users' utilities, is the same as in the original 
mechanism. Therefore we have obtained an optimal incentive compatible direct mechanism where 
the intervention device adopts a randomized intervention rule that sustains without intervention 
the suggested action profile. 



Appendix C 
Proof of Proposition [7] 



Proof: 



Vi(d,U) = E t _ i [Ui(d,U)] = diiUpE^. [Qi - A)] = d^Uf 

dlnV^U) _ Ji 1 

ddi(ti) di(U) n - di(U) - Y%=ij&®tj [djitj)} 
dHnV^U) U 1 



ddf(t v 



dm 



< 



(49) 



Imposing that the first derivative is equal to 0, we obtain that the Bayesian Nash Equilibrium 
d BNE must satisfy, Vi E Af and V/ = 1, . . . , m, 



n m 



(l + r l )df NE (r l )+T l J2P(n)df NE (T k )= m 
The system of equations defined by (1501) can be written as a matrix equation of the form 



(50) 



Ad 



BNE 



(51) 



where 



d 



BNE 



' df NE ' 


jBNE 
i a i 


' df NE (n) ' 


, b = 


h 


, b = 




jBNE 
a n 




_ df" E {r m ) _ 




h 







(52) 
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A T P 

r P A 
r P r P 



r P 
r P 

A 



A = diag (1 + n, . . . , 1 + T m ) , r 



71 



P(n) ... P(r„ 



We want to compute the inverse of the matrix A. We can write A as 





A-r P 




I 


A = 




+ 






A-r P 




I 



r P ... r P 



where I is the identity matrix in ]R mxm . 
The matrix inversion Lemma states that 

(E + BCD) 1 = E" 1 - E _1 B (C^ 1 + DE^B)" 1 DE" 1 

Applying the matrix inversion Lemma to A -1 we obtain 



A" 1 





A-r P 




-l 


A-r P 




-l 


I 


1 _ 




A-r P 






A-r P 




I 



/ 



r P ... r P 



A-r P 



Y 



A 



A-r P 



-1 






I 




I 



(53) 



(54) 



(55) 



(56) 



i -i 



r 



P ... r P 



(57) 



A 
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First, we calculate 



(A 



A" 1 - A -1 ■ t ■ (-1 + P ■ A -1 ■ r) 1 P A" 1 

A -1 - A" 1 ■ r ==- — - r P A 

A' 1 - A' 1 -r ■ (3 -P ■ A' 1 



-i 



where f3 



Now we calculate Y -1 . We rewrite Y as 



t P ... r P 



A-r P 



A-r P 



-1 






I 




I 



1 + 



T 





A" 1 - /3A-VPA- 1 




I 


p 












A" 1 - /JA^VPA- 1 




I 



I + n ■ t ■ P • (A~ x - /3A-VPA- 1 ) = I + r • [n ■ (l - PA _1 r^)] • PA" 1 

I + T ■ ' === — : ~ " PA" 1 



Applying the matrix inversion Lemma to Y" 1 we obtain 



-1 



(58) 



(59) 



r-rv- 



i 



P A 1 I 1 r ) PA 1 I 1 



PA 1 



I- 



n 



-PA 1 



(60) 
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Finally, we can calculate A 1 as 

A- 1 





A-r P 




-i 


A-r P 




-l 


I 


1 




A-r P 






A-r P 




I 



I 



n 



A-r P 



rPA 1 



n -i 



A-r P 



B 



r P ... r P 

C ... C 



B 



C ... C 



(61) 



where 

B = A -1 - /3A _1 rPA _1 
C = (A" 1 - /3A-VPA- 1 ) 



I 



n 



Hence, the BNE can be analytically computed: 

d BNE = A -l b 



rPA 1 ■ rP ■ (A" 1 - /JA^rPA- 1 ) 



(62) 



Appendix D 
Proof of Lemma [8] 

Proof: 

We study the sign of the derivative of z's utility with respect to z' action 

di < di 



dlnUj(f, dj,d-j,t) 
ddi 



U 



1 



di < di < di + 



U_ l + Cj 

di A* - J2k^i dk - di - Ci(di - di, 

U 1 ~ d¥ 

di> di + — 

Ci 



d M 



k di H-Y,kM d k- di~ < 



M 



We denote by df R (d_i) the best response function of user z, i.e., z's action that maximizes 
z's utility when the action vector of the other users is d-i. Since the users' utilities satisfy the 



43 



dUAf di d-i t) 

assumptions A4-A6, — - — — - — > for di < d^ In fact £/;(/, d, ti) is increasing with respect 



0, df R {d- t )j and di < df E ° = df R {d*f) < df R {d.i), where the first inequality is 
an hypothesis of the Lemma and the last inequality is valid because of the submodularity of the 
game. 

• hv dUi(f,dj,d-i,t) , n . ,~ , ^ ,~ 
Imposing the condition — < in di < di < di H , we find 

odi d 



to di in 



Ci > — 7 Tx (63) 



ti [ di di 



The right hand side term of [63] is decreasing in di, therefore the condition is valid in di < di < 

d M 

di H — — if and only if it is valid in di, obtaining 

Ci 

U - ELi dk) - di 
Ci > — ^ ~ J - (64) 

di 

Notice that the condition on c, is a necessary condition for di to be a NE. In fact if it is not 
satisfied then Ui(f,di,d-i,t) is strictly increasing in di and, for the continuity of Ui(f,di,d-i,t) 
with respect to di, we can find an action dj > di such that Ui(f, di, eLj, t) > Ui(f, di, t). 

''t^ < 0in + - 

odi Ci 



■ „ ■ ■ , • dUi(f,di,d_i,t) . , 7 d¥ 
Finally, imposing the condition — < in di > di H , we find 



(65) 



Notice that, given the condition on q, this last condition is sufficient for di to be a global 
maximizer. In fact in this way Ui(f, o^, oLj, i) becomes quasi-concave in d^. increasing for di < d { 
and decreasing for di > di. 



Proof: 



Appendix E 
Proof of Proposition QT] 
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Condition 1, 3 and 4 of [Hare satisfied (we implicitly assume that dff > — - — ). It remains to 

1 + Ti 

verify that 2 is satisfied, i.e., Wj, % G T, u 

pn^L-^4]> Yl ptu&L-^dA (66) 

i-iST-j V fc=l / t-iGT_i V fc=l / 

where, Vj 7^ i, 

4 = ^^ , d, = ^^ , 4- = ^^ , d - 



(67) 



In particular, Eq. (1661) is valid if, Vt_j G T_j, 
Substituting Eq. (1671) into Eq. (1681) we obtain: 

" + g^ ti+f -V + V?y>i (69) 

n + L.k^k + n) \TiJ 

We use the notation a = n + Efc^i *fc> an ^ % — ^. We want to find the condition on T; and x 
such that 

f(x) = ( ^±p^Y +1 x ~n > 1 (70 ) 
Notice that /(l) = 1. We take the derivative of / with respect to x 

(71) 

f{x) >0^ x >a^^>n + J2 k ^t k . 

/(^f) is decreasing in fj until = ^ra + Efc#*fcJ> then it is increasing. This implies that 
for f» < Tj Eq. (1681) is satisfied, i.e., user z has no incentive to report a lower type. However, if 
fj — >■ r^, since /'(I) < 0' men user * ha an incentive to communicate a higher type (this result 
is linked to O. In fact Eq. (1681) is unsatisfied Vt_j G T_j, and therefore Eq. (|67|) is unsatisfied. 
Since the function /(^) increases for fj > Tj + Efc^feJ' me on ly wa Y f° r Eq. (1681) to be 
satisfied is that the function f(x) will eventually reach the value 1 for a value = ^- and all 
the types higher than are higher than the threshold value r th . Notice that it is sufficient that 
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this condition is verified by the type that follows n, i.e., r i+ i. Substituting % with n+i into Eq. 
(1691) we obtain Eq. (l4TT) . 



Appendix F 
Proof of Proposition 

Proof: 

First, we demonstrate that Eq. (|43T) describes a convex problem if r m < n. The constraints 
describe a convex set. We can rewrite the objective function in the following way 



f(d) 



In 



i=l 



n 



i=l 

J7J 



In 



E* liE p (^ 



i=l 



i=l 1=1 



in|/i-E^ -E ln E p ( r 'K ? 

i=l / i=l J=l 

We calculate the partial derivatives of f(d) 



(72) 



En 7 T 



i 



g 2 /(rf) 

"(^-£r=i^) 2 



(E£i *W ft - 1) rf?~ 2 ) (e™i^K") - (YZiPiriYidj- 1 



n\ 2 



d 2 f(d) d 2 f(d) 

We have — 9 > — — - — > 0, where the first inequality is valid if r m < n. 

odj odjddk 

Before concluding, we state and prove the following Lemma. 



(73) 



Lemma 13. The matrix 



H 



ot\ (3 
(3 a 2 




P 



P ... a r , 



(74) 
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where aij > P > 0, \/i — {1, 2, • • • , n}, is positive semidefinite. If the first inequality is strict, it is 
also positive definite. 



Proof: 



H = P 



1 1 ... 1 
1 1 ... 1 

1 1 ... 1 



ai - j3 ... 
a 2 -p ... 



Therefore 



... a n - (3 

/ n N 

v T ■ H ■ v = (cua - 0) v{ + ■ ■ ■ + (a n - /?) v 2 n + P [J2 v i 



(75) 



(76) 



,i=i 



u T • if • v > if <*i > /3 > Vi. w T • # • v > ^ if a; > P > Vi. ■ 
Applying Lemma O to the Hessian of the function f(d) we obtain that the Hessian is positive 

semidefinite, therefore the function f(d) is convex. 

As for the optimality of the a priori incentive compatible mechanism (T, £>, d, 7r), for every a 

priori incentive compatible mechanism we have 



max n4 V (d) = max T)d E t 







E, 


n 

\ 



\U+(f,d,t) 



max | /i 



i=l 



\ * =1 



max | fi 



< maxdKt 



£ di ) E* 

j=i 



\ i=i 



IK 

i=l 



(77) 



Thus if a randomized intervention rule ix sustains without intervention d, the mechanism (T, D,d,n) 
is an optimal a priori incentive compatible direct mechanism. 
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Fig. 1. Interaction between the users and the intervention device 
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Fig. 2. User i's utility vs. user i's action, for different suggested actions 




n 



Fig. 3. Representation of a flow control application as a M/M/l queue 



51 



H = 5,T. = [0.1 1] 
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Fig. 4. Manager's utility as a function of the number of users 




Fig. 5. User i's utility as a function of user i's action for different intervention rules 
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Complete information, (j. = 5 Mbps , T. = [ 0.1 1] 
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Incomplete information, a = 5 Mbps , T. = [ 0.1 1] 
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Fig. 6. Manager's expected utility vs. number of users for the complete and incomplete information scenarios 
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H = 5 , T, = [ 0.1 1] 
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Fig. 8. Total expected throughput and delay vs. number of users for the complete information scenarios 
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Fig. 9. Expected throughput and delay for user vs. number of users for the incomplete information scenarios 



